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Abstract 

The goal of this research is to clarify the development of metaphor by using structure-mapping theory 
to make distinctions among lands of metaphors. In partiailar, it is proposed that cUldren can 
understand metaphors based on shared object attributes bef(»re those based on shared relational 
structure. This ^edicts (a) earfy alnlity to interpret metaphors based on shared attributes, (b) a 
developmenlal inaease in alnHty to interpret metaphms based on shared relational structure, and (c) 
a shift from primarily attributional to primarify relational interpretations for metaphors that can be 
understood in dther way. Two esqperiments were performed to test these claims. There were three 
kinds of metaphors, varying in lather the shared mformation forming the basis for the interpretation 
was attributional, relational, or both. In Esqperiment 1, children aged 5-6 and 9-10 and adults produced 
interpretations of the three types of mets^hors. The attributionality and relationality of their 
interpretations were scored by independent ju^es. In Es^riment 2, duldren aged 4-5 and 7-8 and 
adults chose \diicfa ot two interpretations-relational or attributional-of a metaphor they preferred In 
both e?^riments, relational responding inaeased significantly with age, but attributional responding did 
not These results indicate a developmental shift toward a focus on relational structure in metaphor 
interpretation. 
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METAPHOR AS STRUCniRE MAPPING: THE RELATIONAL SHIFT 



The stu^ of metaphoric development is fraught vnth contradictions. Evidence that young children are 
quite poor at metaphoric interpretation sits ^de by ^de with equally compelling evidence that they are 
uniquely talented at metaphoric language. In e]q)erimental studies of metaphor comprehension, yoimg 
children ^ically perform quite poorfy. A 4*year-old asked "Can a person be sweet?" answers Uterally 
(c.g^ "Not unless he was made out of chocolate") (Asch & Nerlove, 1960). Young children do badly at 
matching sentences with metaphorically related pictures (Kogan, 1975) and at choosing appropriate 
metaphorical completions for sentences (Gardner, Krcher, Winner, & Perldns, 1975). Not until 14 years 
of age can children e3q>lam metaphors such as "Uie prison guard was a hard rock." These and many 
other e}q)erimental results seem to indicate that metaphorical atnlity develops very slowfy. Indeed, untU 
recent^, the dominant position was that metaphor devebps onfy after the child has acquired basic 
competence at literal language (eg., Inhclder & Piaget, 19^). 

This dour assessment contrasts sharpfy ^th the impression gained from children's spontaneous speech, 
which is so full of creative metaphors that some (^servers have viewed early childhood as a period of 
linguistic genius (Gardner, 1974). For example, a 15-month-old ^1 used "moon" to refer not only to 
the moon but to a half grapefriiit and a hangnail (Bowerman, 1982). A 2-year-old boy I observed 
remarked that a crescent moon was "bent, like a banana," and on another occasbn jumped into a pile 
of pillows and announced "leafs." It is unlUcefy that all such extensions could be accounted for as enors 
(Thomson & Chapman, 1975). This seems to leave us mth the paradox that young children can produce 
metaphors on their own but cannot comprehend them in experimental studies. Part of the disparity may 
be due to the kinds of tasks used to assess comprehension. For example, children can demonstrate 
metai^oric ability earlier with enactment tasks than with verbal e^lanation tasks (Gentner, 1977a, 
1977b; Pollio & Hckeus, 1980; Vosniadou & Ortony, 1986). But even with age-appropriate methodology, 
young children perform far less well in comprehension tasks than their fluent production would lead us 
to expect (e.g^ Gentner & Toupin, 1986). 

I suggest that an essential step in sorting out the devetopmental picture is dari^ng what we mean by 
"metaphor." The term covers a number of different kinds of comparisons, var^ng in their complexity 
and in the tature of the commonalities they convey (Lakoff & Johnson, 1^). In this report I use the 
structure-mapping theory of analogy to derive distinctions among kinds of metaphors (Gentner, 1980, 
1983, 1986). To preview the conclusions, I will argue that children can produce and comprehend 
metaphors based on common object attributes before metaphors based on common relational structure. 

Structure mapping. The bade intuition of structure-mapping theory is that an analogy is a mappmg 
of knowledge from one domain (the base) into another (the target) which conv^ that a system of 
relations that holds among the base objects also holds among the target objects. Tlius an ansdogy is a 
way of noticing relatbk.. i commonalities independent^ of the objects to vMc^ diose relations appfy. 
For example, Camot in 1824 explained heat flow by analogy with a waterfall The analogy conv^s tfiat 
just as a gradient from a high level to a low level will cause water to flow, given a path, so a gradient 
from a high temperature to a low temperature will cause heat to flow, given a heat path. Iliis is a 
typical analogy in that the higher order relational structures are identical in base and target if the prooer 
low-order correspondences among objects and functions are made (i.t.^ water — > heat; level — > 
temperature; and water path — > heat path). 

In interpreting an analogy, people seek to put the objects of the base in one-to-one correspondence 
^th the objects of the target so as to obtain maximum structural maich.^ The corresponding objects 
in the base and target do not have to resemble each other at all; object correspondences are determined 
by roles in the matching relational structures. Central to the mapping process is the principle of 
systematicify: People prefer to map systems of predicates linked by l^er order relations with 
inferential import rather than to map isolated predicates. The systematidty principle is a structural 
expression of our tadt preference for coherence and deductive power in interpreting analogy. Besides 
analogy, other kinds of similarity matches can be distinguished in this framework, according to whether 
the match is one of relational structure, object de^ptions, or both. As noted above, analogies discard 
object descriptbns and map relational structure. Mere appearance matches are the opposite: They map 
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aspects of object descriptions and discard relational structure. Literal similarity matches map both 
relational structure and object descriptions. 

Metaphor. Now let us appfy this framework to metaphor. Metaphors can be divided into four partially 
overlapping categories: attr3)utional metaphors, relational metaphors, double metaphors, and a category 
of complex metaphors that re^ anafysis as one-one mappings and that I will not con^der here 
(Centner, FalkenLdner, & Skorstad, IS^. Attributional metaphors' (e.g., '*Her arms were like twm 
swans**) are mere appearance matches: they convey common c^ject attributes. Here, the attributes 
long^** "thin," and "graceful** can be mapped from the base domain of swans to the target domain of her 
arms. Relational metaphors (e.g., Shakespeare's **Look, he's lading up the watch of his wit; by and by 
it will strike**) can be anafyzed as analogies: they convey that the base and target share common 
relational structure. Here, the intended commonalities have nothing to do with the object attributes of 
a watch-a glass face, metal cogs, and so on; instead, the metaphor conveys the common relational 
structure of a person setting a mechanism that will later produce seemingly spontaneous external efifects. 
Finalty^ besides pure relational and attributional metaphors, there are ako double metaphors that are 
mixtures of the two, such as Verbrugge and McCarrell's (1977) example, Tlant stems are drinking straws 
for thirsty trees.** This metaphor conveys both the common attributes **long, thin, tubular** and the 
common relational structure **r£uses fluids from a lower to a higher place in order to benefit some 
aeature.** 

Adults have been found to prefer relational metaphors over attributional metaphors and to focus 
primarily on relational commonalities in interpreting metaphors (Centner, 1986; Centner & Landers, 
1985). In a prior study on which the present research is ba^ed, subjects produced interpretations of 
metaphors and rated their aptness, having first written out descriptions of the objects contained in the 
metaphors (Centner, 1986). The results indicated a relational focus m metaphor interpretation in two 
ways. First, although the object descriptions contained both object attributes and relational information, 
only the relational information survived into the metaphor interpretations. For example, one subjea 
described "dgarette** as follows: **chopped cured tobacco in a paper roll/with or without a filter at the 
end/held m the mouth/lit with a match and breathed through to draw smoke into the lungs/founa 
widely among humans/known by some cultures to be damaging to the lungs/once considered beneficial 
to health.** This description contains both relational and attributional information. But the same person, 
when interpreting die metaphor ''Cigarettes are like time bombs,** included only relational information: 
They do their damage after some period of time during ^ch no damage may be evident.** The second 
indication of relational focus is that subjects considered metaphors more apt when they could find 
relational interpretations: Subjects' aptness ratings for metaphors correlated positively with the judged 
relationality of their interpretations. In contrast, attributional commonalities did not contribute to 
aptness: The correlation between aptness ratings and attributionality was nonsignificant and negatively 
trending. Adults thus demonstrate a relational focus both in interpreting and in jud^ng metaphor. 

Do children show this same relational focus? Although research on metaphorical development has 
not m general addressed this question, there is reason to suspect that they do not. Billow (1975) asked 
children to interpret two kinds of metaphors: **similarity metaphors,** based on shared attributes (e.g., 
"Hair is spaghetti") and "proportional metaphors," based on a class of shared relations (e.g., "My head 
is an apple ^^out any core"). Children 9-14 years old were much better at interpreting the similarity 
metaphors than the proportional metaphors. Fiirther, fgNtn similarity metaphors evcE S-year- 
olds could produce sensible interpretations almost 30% of the tune. (They were not given the 
proportional metaphors.) These results suggest that attribute matches may be developmentally prior 
to relational matches. If this is true, then many of the apparent paradoxes in the literature on metaphor 
development may be traceabk to (Ufiferent developmental patterns for different kinds of metaphorical 
matches. The present research investigates this possibility. 

Experiment 1 

Experiment 1 wa^ a straightforward extension of tiie method used in the Centner (1986) study. Children 
and adults interpreted meta|.hors of different types and rated the aptness of the metaphors. Their 
interpretations were then scored by independent judges for relationality and attributionality (see Centner 
& Stuart, 1983, for further details).^ There were three metaphor types: (a) attributional metaphors 
(e.g., Tancakes are nickels" [both are round and flat]); (b) relational metaphors (e.g., "A tire is a shoe" 
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[both are used by moving figures as points of contact with the ground]); and (c) double metaphors, with 
both attributes and rehdons in common (e.g.» Tlant stems are drinldng straws** [both are long and 
cylindrical; both are used to bring liquids from below to nourish a living thing]). 

If adults follow the predicted relational strategy, their responses should show three characteristics. 
I trst relational and double metaphors should be interpreted relationally. (This prediction does not 
apply to the attribute metaphors because they are designed not to have legitimate relational 
interpretations.) Second, adults should prefer metaphors that allow relational interpretations, so their 
aptness ratings should be higher for relational and double metaphors than for attribute metaphors. 
Hnally, adults' aptness ratings for metaphors should be positive^ correlated with Uie relationality of their 
metaphor interpretations but not with the attributionality. To the extent that children fail to follow a 
relational interpretation strategy, their results will differ from this pattern. 

The e3q)erimental procedure was aimed at minimiring the influence of other developmental changes 
besides metaphorical development The first potential confound is differences in pragmatic task 
kno^edge: Yoimg children may not know ^tn they are supposed to ffvt metaphorical answers. To 
circumvent such differences, a series of amplifications was used to make the task clear. A second 
possible confound is age differences in domain knovdedge. To minitniy^ such differences, only domains 
likely to be familiar to young children were used Hnally, to rule out differences in prior exposure to 
spedfic items, all comparisons were novel; no idioms or conventional metaphor? were used. 

Method 

Subjects. There were 10 subjects from each of three age groups: 5-6-year-olds (5 boys and 5 ^Is), 
9-10-year-olds (4 boys and 6 gjrls), and college students from psychology classes at the University of 
California at San Diego (7 males and 3 females). Children were recruited from schools in Del Mar and 
La Jolla, California. 

Materials and design. There were eight instances each of three types of metaphor, as shown in Table 
1: (1) attributional metaphors (A metaphors), in vAnch base and target shared many attributes but few 
relations; (2) relational metaphors (R metaphors), in which base and target shared relational structure 
but few attributes; and (3) double metaphors (D metaphors), in yMch base and target shared both 
relations and attributes, llius, each subject interpreted 48 objects and 24 metaphors. The design was 
Age (3 levels) x Metaphor T^pe (R, A, or D). 

[Insert Tabic 1 about here.] 

Procedure. The task was administered to the adults in written form, in groups. Subjects were first 
asked to describe the separate objects that later appeared in the metaphors. The objects were presented 
in semirandom order, with pairs from the same metaphor separated. All subjects, including children, 
were able to demonstrate basic familiarity with all of the terms used. After completing the object 
descriptions, subjects were ^ven the metaphors. They were asked to write their interpretations of what 
the metaphors were intended to convey and also to rate the aptness of each metaphor (i.e., how clever, 
interesting, or worthwhile it was) on a 1-5 scale. They also rated the metaphoridty of each comparison 
(i.e., the degree to svbich it expressed literal similarity as opposed to metaphorical relatc^ess). 
Metaphoridty ratings were not elicited from children and are not considered fmther here. Order of 
presentation was semirandom, with the constraint that metaphors of the same type were separated. 

The task was administered to children orally and individually, and responses were tape-recorded. 
Children were first asked to describe the 48 objects. Then they were asked to interpret the metaphors. 
A graded series of amplifications was used to be sure that children understood the task. Tht tedmique 
was essentially one of restating metaphors as similes, based on Reynolds and Orton/s (1980) finding 
that young children perform better with dmiles (**X is like a Y**) than with metaphors is a Y**). The 
experimenter would ask, **Is a hummin^ird a helicopter?" If the child responded literally (e.g., **No, a 
hummb^d is a bird"), the experimenter asked, "What does it mean if 1 5ay 'A hummingbird is a 
helicopter"?" If the child still responded literally, ihc experimenter asked, "What does it mean if I say 
'A hummingbird is like 2i helicopter*?" If the child m^ itmned a literal response after this third question, 
the literal response was recorded and the experimenter went on to the next metaphor. This tedmique 
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was quite r^uccessfiil, elidtiug nonliteral responses in all but a few cases. Moreover, after a few such 
sequences, most children caught on that a nonliteral response was desired in this task and thereafter 
produced nonliteral responses immediately for each new metaphor. After interpreting the metaphor, 
children rated its aptness by pointing to one of five schematic faces, ran^g from very sad (low aptness) 
to very happy (high aptness). They were told to point to the happy face if they thought the metaphor 
was mteresting and to the sad face if they thought it was dull or boring. 

Scoring. The purpose of the scoring was to provide an unbiased assessment of the commonalities 
selected by the subjects in their metaphor mterpretations: in particular, Aether these commonalities 
were rdationa! or attributionaL Groups of from two to four advanced undergraduates who had received 
training in the use of proportional notation rated each metaphor interpretation on two separate 5-pomt 
scales, an attributionality scale and a relationality scale. Attributionality and relationality were scored 
on separate scales to allow for the pos^bility that the same response might contain both kinds of 
information. For the attributionality rating, judges rated whether the response described obj'~':ts in 
and of themselves (e.g., **X is yellow,** or **X is a sphere**). Most adjectives received high attributionality 
ratings, as did many concrete nouns. For die relational!^ rating, the judges scored whether the response 
described relations between objects (or, m the case of higher order relations, between other relations 
in the domain), Tran^tive veibs (e.g., **X jhases Y,** or **X causes Y**), comparative adjectives (e.g., *'X 
is longer than Y**), and prepositions (e.g., **X is inside Y**) tended to receive high relationality ratings.^ 
The specific content of the relations could vary widely, mcluding spatial, causal, functional, and other 
kinds of relations. 

Two further pomts must be made about the scoring. First, to minimize the effects of diffierences in 
length of responses, an interpretation was given a h^ rating on relationality if it included any clearly 
relational statement, and similarly for attributionality. This method is sen^tive to the presence or 
absence of relational (or attributional) information but not to the number of different relations (or 
attributes) mentioned. Second, because of the partiy subjective nature of the scoring task, it was 
important to remove potential sources of bias. Therefore the judges were not told the su^ects^ aptness 
and metaphoridty ratings, and only one of the judges was aware of the experimental hypotheses. To 
conceal the ages of the subjects, the metaphor interpretations were scored m random order. Interrater 
agreement ranged from 85% to 100% on different metaphors. 

Results 

There was a marked developmental increase m the use of relational information m the metaphor 
interpretations. Table 2 shows sample mterpretations produced at different ages, as well as the ratings 
they received. The mean rating of the relationality of metaphor mterpretations mcreases steadily with 
age for both the relational (R) and double (D) metaphors QFig. 1, top). (This mcrease does not occur 
for the attributional (A) metaphors because they do not permit relational mterpretation.) In contrast, 
the attributionality of the metaphor mterpretations remains constant across age for all three classes of 
metaphor (Fig, 1, bottom). In short, there is a specific developmental mcrease m the relationality, but 
not the attributionality, of the metaphor mterpretations. 

[Insert Figfire 1 about here.] 

[Insert Table 2 about here.] 

Two separate two-way 3 (Age) x 3 (Metaphor Type) analyses of vari^^nce were performed, one for the 
relationality ratings and one for the attributionality ratings. In the relationality analysis, there was a 
main effect of age, confirming the developmental mcrease in use of relational information in mterpreting 
metaphors, F(2^ = 12.76, p < .01. The interaction of Age x Metaphor Type was also significant, 
refle^ing die fact that the age increase in relationality occurs only for the R and D metaphors, F(4,S4) 
= 5.48, p < .01. In the attributionality analysis, neither the mam effect of age nor the Age x Metaphor 
Type interaction was significant, confirming that there was no developmental mcrease in attributionality. 
In both the relationality and attributionality analyses, the main effect of metaphor type was strongly 
significant, F(2,54) = 191.63,p < .001,^(2^4) = 265.06 p < .001, respectively. For all ages, Uie a priori 
categorization of stimuli held up well: on relationality, R metaphors were highest, followed by D and 
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then A metaphor^ and on attributionality^ A metaphors were highest, followed by D and then R 
metaphors. 

The performance on D metaphors is of special interest. By design, the D metaphors could support 
either an attributional or a relational interpretation. The results show a relational bias m the two older 
groups but not in the youngest children. Planned comparisons between Uie relationality ratmg^> and the 
attributionality ratings of the D metaphors showed that^ for the 9-year-olds and adults, the mean 
relationality is significant^ greater than the mean attributionality, t{9) = 2.78, p < .05, and t{9) = 3.79, 
p < .05, respectively. For Sie 5-6-year-olds, there is no significant (Ufference, t{9) = 1.93, p > .05. 

Aptness ratings* As predicted, adults preferred metaphors for \^ch they could find relational 
mterprelations. Adults' mean aptness ratings for R{M = 2.86) and D{M » 2.95) metaphors were 
considerably higher than for A metaphors (Af « 230), r(79) = 430, p < .001, for D versus A, and 
^(79) - 3.87,/? < .001, for R versus A metaphors. Children did not ^ow this relational preference. 
For 5-6-year-olds, M « 2.08, 2.14, and 226, and for 9-10-ycar-olds. M = 1.99, 2.18, and 2.19 for R, D, 
and A metaphors, respectively. Children's aptness ratings did not differ significant)^ across the three 
types of metaphors. Another indication of relational focus in adults is that their aptness ratings were 
positive^ correlated with the relationality of their interpretations and negatively conelated with the 
attributionality, r (22) - SS^p < .01, an^ r (22) « -.42, p < .05, respectively. CUldren did not show 
this pattern. Their aptness ratings showed no correlation with either relationality, r (22) - .15 for 5- 
year-dds and -.18 for 9-year-old^ or attributionality, r (22) = .01 for 5-year-olds and .29 for 9-year- 
olds. We must be cautious in evaluating the aptness data because the children may simply have ladced 
fedlity with the aptness scale. Still, the results suggest a developmental difference in the criteria for 
judging aptness in metaphor. 

Discussion 

As predicted by structure-mapping theory, adults focused on mappmg relational systems. There are 
several indications of this pattern. First, adult interpretations for R and D metaphors were hi^y 
relational; this is especially telUng for D metaphors, ^ch could have supported either kind of 
mterpretation. Second, adults rated the R and D metaphors as more apt than the A metaphors. Third, 
adults' aptness ratings correlated positive^ with the relationality of their interpretations but negatively 
with attributionality. This pattern replicates and strengthens the positive conelation between aptness 
and relationality found in prior research wiUi adults (Centner, 1986). Adults appear both to seek 
relational predicates in metaphorical mapping and to judge the aptness of a comparison according to 
the relationality of the mapping. 

Young children did not share this relational focus. The 5-6-year-olds produced fewer relational 
mterpretations of R and D metaphors than the older groups, and they sho^;^ no tendency to produce 
relational rather than attributional interpretations for the D metaphors. There was a marked increase 
m relationality with age. This developmental trend was specific to relational information; there was no 
age increase in attributionality. Turning to aptness, one might have suspected that children, though not 
fluent enough to produce relational interpretati(»is, would nevertheless show an appreciation of 
relationality in their aptness ratings. But this was not the case. Children's aptness ratings reflected no 
preference for R and D metaphors over A metaphors. Further, there ym no conelation between 
children's aptness ratings for metaphors and the relationality of their own mterpretations. 

Overall, these results appear to show a clear developmental increase in relational focus. But there 
were some possible problems with the methodology used in Experiment 1. One limitation was Uie use 
of verbal explanations as responses. Children's ability to give an explicit veibal account of their 
kno^edge tj^icalty lags behind their ability to demonstrate that knovdedge in other ways (e.g., Poliio 
& Pickens, 1980; Vosniadou & Ortony, 1983, 1986). A second concern was the aptness-rating task. 
As discussed above, it was not clear that children understood the sad-to-smiling face scale. To address 
these problems, a second e]q)eriment was performed using a dioice task rather than a production task. 
Children were presented with two Interpretation-one relational and one attribuUonal-for each metaphor 
and were asked to choose the inteipretation they preferred. In this way, young children could express 
a preference for relational interpretations without having to generate diem. After this choice task, a 
rating task was given, using a new method to elicit the ratings. Children were asked to indicate their 
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ratings on a **goodness meter/ a vertical scale something like a thermometer. Fmally» one other 
difference between the two studies was that, because E^^^riment 2 used a comprehension paradigm, 
slight^ younger children could be tested thaln in Experiment 1. 

Experiment 2 

Method 

Subjects. There were 36 adult subjects, all Univer^ty of Illinois undergraduates, and 36 children: 18 
4*S-year*olds, with a mean age of 4.10, and 18 7-8-yearH>lds, with a mean age of 7.11. They were 
approximate^ evenly divided between males and females* 

Materials. Eighteen metaphors were used: eight R metaphors, eight D metaphors, and tv/o A 
metaphors* The metaphors were taken from Ea^riments 1 (see Table 1), except that two of the D 
metaphors from Esqperiment 1 v^re judged unsatisfactory and replaced: the c?etaphors "A lake is like 
a mirror" and "Stars are like diamonds** were replaced by "Blood is like water" and "A submarine is like 
a ^lale." Because it proved to be difficult to design plau^le relational interpretations for the A 
metaphors, only two A metaphors were used» both taken from Es^riment 1: "Jelfy beans are like 
balloons" and "A snake is like a hose/ Due to experimenter error, the two new D metaphors were 
omitted in the adult version of the choice task, leaving only 16 metaphors: eight R, six D, and two A 
metaphors. 

For each metaphor, two relational and two attributional interpretations were devised, based on the 
mterpretations generated by subjects in E)q)eriment 1. Three criteria were applied in selectmg among 
the responses generated in Esqperiment 1. First, to be sure that the responses were either dearly 
relational or dearfy attributional, the selection was limited to interpret&dons that had received scores 
of 4 or better from the judges in E}q)eriment 1. Second, among the interpretations that met the first 
criterion, the two most frequent relational interpretations and the two most frequent attributional 
interpretations were selected Third, because it was crucial that young children understand the choices, 
in tallying the responses those of S-6*year-olds were given the most wei^t* For a few metaphors, there 
were not enough responses from Esqperiment 1 that met the criteria. In this case, I either used a less 
frequent response or invented an extra response to bring the total up to two attributional and two 
relational interpretations* Responses were reworded to simplify any difficult language. Table 3 shows 
samples of the metaphors and interpretations used. 

[Insert Table 3 about hcre*l 

Design and Procedure 

In both the choice task and the rating task, the key factors were age (3 levels), metaphor type (R or 
D, ^th A considered separate^), and interpretation type (relational or attributional). For adults, tlic 
tasks were administered in written form to two separate groups of 18 subjects. For the choice task, each 
metaphor was g^n with its two interpretations and subjects indicated which interpretation they 
preferred For the rating task, each metaphor was given on a page with its two interpretations, and 
subjects were asked to rate how well they liked each interpretation on a 1-5 scale, with S being "very 
interestmg" and 1 being "boring." In both tasks, the list o metaphors was presented twice, once with 
each set of interpretation pairs. Two different orders were used, one the reverse of the other. 

Children were tested individually in their homes. They received the choice task followed by the rating 
task* A quiz-show format was used to sunplify the task. Two stutfed dolls v^re used as quiz show 
contents. Each doll had a canister to collect its winnings. The diild was told that Pmk Panther and 
Tweetie Bird were competmg on a quiz show, and that it was the child's jc^ to indicate v^di doU had 
given the best answer by dropping a token into the imning doll^s canister. To be sure children 
understood the format, each child completed two practice tasks before starting the actual experiment. 

Phictice tasks* The first practice task was picture labeling. The experimenter held up a picture of a 
familiar animal and e)q;>lained that the goal of the contestants was to ^ve its common name (e.g., 
"elephant"). One doll produced the correct answer and the other a dearly incorrect answer. The child 
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indicated i^ch doO was correct by dropping a token in that doU^s canister. In this and all succeeding 
tasks, each doll was correct half Jie time, in a semirandom predetermined order. The second practice 
task was sentence interpretat](Hi« A ^ple sentoice was read and each doll gave an mterpretation of 
the meaning of the sentence. One interpretation was always correct and the other clearly wrong. For 
example, thi sentence mi^t be **Santa Gaus is on the roof and the two interpretations: ^There's a joUy 
fat man on the roof (correct), and **I had spaghetti for dinner^ (wrong). The child indicated the correct 
answer in the manner descr3)ed ?bove» Tlie criterion for pasdng the practice tasks was ^ correct in 
no more than 10 trials. In fact, n ost children achieved criterion in fewer than eight trials. Only one 
child-a 4*yearH>ld-had to be diojptd from the ei^eriment due to Culing the practice task. After the 
practice tasks, the experiment bc^an with the choice task for metaphor interpretations. 

Choice task. The oqperimenter read the ''quiz question'' metaphor for example, **How is a cloud like 
a sponge?* Then the esq^erimenter would point to each doU in turn and gh« the doU's respo^^ ''He 
says, ^Because the/re boA soft and fluff^; and he says, They both hold water.' Whose ans^/er is 
better?" The child would give a token to the winmng doQ. In all cases, (Hie doll gave an attributional 
interpretation and the other a relational initerpretation, with the asugnment varying such that each doll 
bad equal numbers of relational md atttibudonal answers. Order ot the relational and attributional 
mterpretation was varied so that each occurred first half the tune. After all 18 metaphors had been 
presented with the first set of interpretations, they wei^ presented agam with the second set of 
interpretations. The order in whidi th^ relational and attributional interpretations was given on the 
second pass was reversed from the cider on the first pass. 

Ratlngtask. The eiqierimenter b^an by saying, T^ow we're going to do something else. Youpicked 
^ch answer you liked better. Noiy you're going to tell me just how good you think each answer is." 
Then the child was shown the "goodness meter," a vertical scale standing about 2 feet high and 3 inches 
wide, with gradations running from 1 (lowest) to S (highest). The e3q)erimenter pointed to the numbers 
and described thdr use: "A S means that the answer is realty good. But a 1 means that it's reaUy bad. 
You didn't like the answer at all Now 3 is in the middle. It means that the answer was OK." \^en 
the child could demonstrate kno^i^edge of the scale, the rating task began. The e}^rimenter read the 
"quiz question" metaphor (e.g^ "How is a cloud like a sponge?") and gave the first doU's response: "He 
says, 'Because the/re both soft and fluffy.' How good is that answer?" After the child rated the answer 
on the goodness meter, the esqperimenter repeated the metaphor and gave the second doU's response, 
^ch the child again rated. 

Deflnltion task. At the end of the session, as a check on their knowledge, children were asked to 
define the words used in the metaphors. For younger children, all words were tested. For older 
children, only more difficult words (e.g^ "submarine") were tested. If children had difficulty with the 
initial questions (e.g., "What is a sponge?"), the question was changed to "What do you do with a 
sponge?" All children demonstrated adequate un(ferstanding of the words. 

Results and Discussion 

Choice taslc. Relai uonal responding inaeased with age, as shown in Figure 2. 

[Insert Figure 2 a^MMit liert.) 

The proportions of relational responses for the R and D metaphors were .61, .69, and .89 for the 4*5- 
year*dds, 7-8-year-olds, and adults, respectively. A 3 (Age) x 2 (Metaphor T^pe) mixed-measures 
analyius of variance on the proportion of relatbnal responses for the R and D metaphors showed a main 
effected age, confirniing the devek)pmental increase in relational preferen^ » iO.Sl,p < .0001. 
(However, all age groups showed above-chance relational re^Kmding pColgoroff-Smimov goodness-rf- 
fit with chance at all je*s > l.SO,p < .0S]>) There was also a main effrxt of metaphor type, reflecting 
a stronger relatbnal preference ob R metaphors (M » .77) than on D metaphors (M » .67), F(l,51) 
» MSf^p < .001. Tlie interaction between age and metaphor type was not significant 

Rating taslc There was an age increase in relational bias on the R and D metaphors. As shown in 
Figure 3, the mean rating of the relational interpretation inaeases with age, and the mean rating of the 
attributional interpretation deaeases with age. These patterns were confirmed m a 3 (Age) x 2 
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Metaphor T^P^) x 2 (Interpretatioa Type) imxed*measures anafysis cf variance over the R and D 
meUphors. As predicted, there was a maia effect of interpretatioa typt^ F(l^l) » 95JSS,p < *0001, 
coniirming that ratings were hi^ier for reladonal interpretations than attributioaal interpretations* 
There was also a aiain effect q( agc» reflecting hij^ ovmll ratings for the 7-S-year-olds tlum for the 
other two groups, F(2|S1) > 6^,p < *0L The ej^ected interaction of Age x Interpretation Type was 
signtficant,F(23l) « 30^,p < *0001, reflecting both the rise in ratings for the rehtionaimterpretat:^ 
and the dn^ in ratings for the attribudonal interj^etaticms with age* There was also a Metaphor x 
Interpretati<m Interaction, F(l,51) » 13*94,P < •001, indicating a somewhat greater relational preference 
on the R metaphors than on the D metaphors. There was no main effect of metephor typft and no other 
significant interactions. 

[Insert Figiire 3 aboot hm*] 

As shown in Figure 3, relaticmal interpretations become steadily more valued ^th age and ^ttributional 
mterpretations become less valued. Planned comparisons confirmed Uiat the older groups rated 
relational interimtations agnificantly higher than attributionat interpretations. For R metaphors, the 
difference was »gnificant for both 7-8*year-olds and adults, but not for the 4*S*year-oIds, 1(17) > SJSO, 
p < .0001; r(17) « 10.17,p < SXJOl; and ({IT) « t33,p > .05, respectively. For D meUphors, only the 
adults showed a significant difference, t{17) > 939, p < .0001; neither the 4-S*year-olds nor the 7-8- 
year-olds showed a Significant relational preference on D metaphors, 1(17) < 1, and ({17) > 1.89, p < 
.08, respectively. 

Attribotional metaphors. On the A metaphors, subjects showed a clear preference for the attributional 
resp(Hises, as e3q)ected. In the choice task, all groups chose the attributional response more often than 
the relational response on the two A metaphors (see Fig. 2). Similarly, in the n ting task all age groups 
rated the attributional respcmse hi^er than the relational response on the two A metaphors (see Fig. 
3). This difference was ugnificant for 7-8*year-old$ and adults, 1(17) « -2.71, p < .05, and 1(17) > 
-4.91, p < .0001, respectively, although not for 4-5-year-dds, t(17) » -1.8%p < .09. Performance on 
the A metaphors helps rule out some artifactual explanations of the results. The apparent relational 
preference on R and D metaphors might have been based on some response hL^ such as a p. ^ference 
for longer' or more abstract-sounding mterpretations. However, the subjects* preferen^^ !or attributional 
mterpretations on the A metaphors tends to mdicate that their preference for relational interpretations 
on the R and D metaphors was a genuine interpretational choice. 

General Discussion 

The results of the two e^qwriments show a developmental mcrease m relational focus in the 
interpretation of metaphors. As m prior research (Centner, 1986), adults showed a strong rrhtional 
bias. In E)qperimcnt 1, they generated relational interpretations ii^enever possible, and uey rated 
metaphors as more ;ipt fhc more relational commonalities they found and less apt ^he more attributional 
commonalities !!iey found. In &qperiment 2, on both the dioice task and the rating task, adults showed 
a c!<:ar preference for relational mterpretations for relational and double meta;ihors. 

This relational focus developed gradually. In E3q)eriment 1, the rt;lationality of the interpretations 
produced for relational and double metaphors increased steadily from 6 years to adulthood. E)q)eriment 
2 showed the same development^*! pattern for comprehension. Both on the choice task and on the 
ratings task, tfiere was a developmental shift toward preferring relational interpretations over 
attributional interpretations of relational and double metaphors. In contrast, on the attributional 
metaphors, all three age groups prefened attributional interpretations both in production and in 
comjMrehension; there was no developmenta! change wi j^isc^r. Tbi^ indicates (a) that the 
developmental shift is specifically relational and not simply the result of scne general increase in 
verbiage vnAi age, and (b) that not all metaphors are alike developmentaUy. Indeed, young children 
seem to deal with attribute matches in much the same way as adults. What develops is relational 
similarity. 

The notion of a relational shift helps clarify the conflicting findings in the development of metaphor. 
For example, the paradox of production before comprehension may admit of the follo\ring resolution: 
children both produce and comprehend attributional metaphors very early^ but relational matches come 
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later. Because comprehension tasks typically include both kinds of metaphors, children do poorly in 
these tasks iwcll after they can spcntaneousfy produce attributional comparisons. A supporting 
observation is that most of Uie earfy spontaneous metaphors appear to be based on perceptual attribute 
overlap, as in Bowerman's "moon-hangnail" example quoted above. In Winner's (1979) study of the 
spontaneous metaphors of one child (Adam) from age 23 to 410, the most frequent basis for 
metaphorical comparison was the shared object attribute oi contour, for eicample, referring to a pendl 
as a Ing needle." Relational metaphors occasionally occurred, especial^ during pretending games (e.g., 
at 33 Adam placed a small alphabet letter on top of a big one and said, "Adam sleeping on Dadd/X 
but they were rare. 

There is some prior research supporting a relational shift* In the study by Billow (1975) discussed 
above, metaphors based on shared &^<^Ibutes (e.g., "He had a pickle for a nose") were comprehended 
well before "proportional metaphors," based on shared relations (e^ *The su«t wakes the seeds"). The 
relational shift is also onnpatible with Keil's proposed shift frcmi characteristic to defining features in 
category judgments (KeQ & Batterman, 1984). Typicalfy, characteristic features are object attributes, 
such as the shape of an animal, and defining features are relations, such as the causal regularity that 
parents produce children X)f the same species as themselves. Another antecedent is Bruner's proposal 
that children move from relying on perceptual^onfigural information to refying on ftinctional 
information (Bruner, Olver, & Greenfield, 1966). 

The notion of a relational shift allows a more orderfy account of the development of metaphor. But 
wbal underlies this shift? At least three possiUlities are suggested by the literature. The first pos^ility 
is that it reflects an increase in basic cognitive competence (Inhelder & Piaget, 1958; Pascual-Leone, 
1970). Thus, Billow (1975) interpreted findings in terms of cognitive stages: similar (attributionaQ 
metaphors are early because they can be understood at the concrete operational level; proportional 
(relational) metaphors are acquired later because they require formal operational thinking. The second 
possibility is that the relational shift reflects children's learning of adult pragmatic conventions 
concerning ^^t to map. If children did not share the aesthetic that metaphors are supposed to be 
about shared relations, then their failure to perform relational mappings would not necessarily indicate 
a lack of competence. The third possibility is that the relational ^dft results from acaetion of domain 
kno^edge; it is an instance of the no^ce-e}q>ert shift (Carey, 1984; Chi, Feltoi^ch, & Glaser, 1981; 
Larkin, 1983; Siegler, 1976). By this account, children's failure to perform relational mappings results 
not from cognitive deficiencies but from a lack of kno^^edge of the necessary domain relations. 

Although there is as yet no definitive e^dence, there is one line of comparison that can differentiate 
among these three approaches. The first two accounts-the cognitive competence account and the 
pragmatic insight account-both postulate a global shift that applies aaoss domains. In contrast, the 
domain-knovdedge account postulates that the relational shift occur at different times in different 
domains. Moreover, il should occur earliest in those domains that are most accessible to youiig children. 
This position is supported by the fact that the few studies that have shown earfy atnUty to perform 
relational mappings have used extreme^ familiar domains (Brown & Campione, 1985; Crisafi & Brown, 
1986; Gardner, 1974; Gentner, 1977a, 1977b). For c^jnple, in an earlier study I found that very young 
children could interpret metaphors \rith body parts as the base domain (Gentner, 1977a, 1977b). 
Children were shown pictures of familiar objects such as trees and asked questions like, "If the tree had 
a knee v^ere would it be?" Children as young as 4 could accurately map the familiar spatial relations 
among body parts, even ^en the task was made more difficult by turning the tree upside down or by 
adding misleadmg details. Similarly, Crisafi and Brown (1986) found that preschoolers could learn to 
do a difficult combination-of-inferences problem by analogy with an easier problem, but only if the base 
problem was made up of extremely familiar objects and activities. 

By the domain-kno^edge account, the relational shift should occur early for some domains and late 
for others. DeLoache's (1985) work on search beha^or provides intriguing e^dence for a very early 
relational shift in spatial matching ability. At 3.1, children can solve problems in ^ch they see an 
object hidden in a furnished room and then look for a tiny replica of the object in a tiny replica of the 
room. But at 2.7 children fail this tasl^ they can successfully find the object only if the two rooms are 
identical. Perhaps it is during this interval that understanding of spatial relations progresses to the pomt 
where children can match on the basis of relational structure without the support of object identity. At 
the other end of the developmental spectrum, Sternberg and Downing (1982) found that adolescents go 
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throu^ some of the same stages of mterpretation y^bcn dealmg with complex analo^es between 
ando^ that younger diildren do wUh ample analogies. 

The domain-knowle<^ account of the relational sluft still faces unanswered questions. In order to 
account for the data, the donuun-faiowk<^ account must presuppose that in eveiy donsun relational 
informaticm is slower to be acqmred than olject-attribute informati(Ht This may be a plau^le intuition, 
but it has not been verified A second problem is spedfynig ifdiat omstitutes a domain* Despite these 
unanswered questions, the domam-kno^^rfedge account is tte best current candidate for e}q)laining the 
relational shift. 
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Footnotes 



^e version of structure mapping presented here is a descriptive theory. A computer simulation 
based on a process model also exists (Falkenhainer^ Forbus, & Centner^ 1986; Centner, Falkenhainer, 
& Skorstad, 1987). 

^or present purposes, metaphor ("An X is a Y*) and simile (**An X is like a Y") vnH be classed 
together. 

^This e)q)eriment was conducted by P. V. Stuart as an undeigraduate honors project at UCSD. 

^Althou^ most scoring ded^ons were straightforward, there were some ambiguous cases. These 
generally involved relational adjectives such as edible and sopmfic or relational nouns such ^s father. 
These terms express relational information, yet they appear on the surface as adjectives (Miller, 1979). 
For OQunple, the adjectival phrase "X is soporific" is on the surface a one-place predicate (soporific 
[X]). But its meaning is the relational proposition "X puts people to sleep." Because of their ambiguous 
status, such relational adjectives were scored as intermediate (3) on both the relational and the 
attributional scale. 

^e relational responses were in general longer (iu number of words) than the attributional 
responses, so a length tnas would have iiilated the number of relational responses. As another check 
on this possilnlity, I compared the results for four R and D metaphors that had the reverse pattern- 
attributional responses longer than relational responses-with the overall results for R and D metaphors. 
There were no systematic differences in the results. 
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Table 1 

Materials Used in Experiment 1 



Metaphor type 



Examples 



Relational metaphors 



Attributive metaphors 



Double metaphors 



The moon is like a lightbulb. 
A camera is like a tape recorder. 
A ladder is like a hill. 
A cloud is like a sponge. 
A roof is like a hat. 
Treebark is like skin. 
A tire is like a shoe. 
A ^dow is like an eye. 

Jelly beans are like balloons. 
A doud is like a marshmallow. 
A football is like an egg. 
The sun is like an orange. 
A snake is like a hose. 
Soapsuds are like whipped 

cream. 
Pancakes are like nickels. 
A tiger is like a zebra. 

A doctor is like a repairman. 
A kite is like a bird. 
The sky is like the ocean. 
A hummingbird is like a 

helicopter. 
Plant stems are like drinking 

straws. 
A lake is like a mirror. 
Grass is like hair. 
Stars are like diamonds. 
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Table 2 

Sample Interpretations of Different Classes of Metaphors in Experiment 1 



Age 



Attributional Metaphor 
(The sun is an orange.) 



Relational Metaphor 
(A tire is a shoe«) 



Double Metaphor 
(Plant stems are 
drinking straws.) 



3*6 • a I 



9-10 



Adult, 



They're round and 
orange. (1^ 



It's like a circle and so 
is the sun. (1^) 



Both are orange; both 
spherical. (1^) 



Sometimes your shoe 
is black and the tire is 
black. (14) 

You can go places on 
both. (5,1) 



Both are coverings 
that come m direct 
contact v^ith the 
terrain. (5,1) 



They're both straight. 
(14) 



Plant stems are thin 
and so are straws. 

(14) 

They are both used for 
drawing in water. 
They are both tubular. 
(5,5) 



Note. The figures in parentheses give the rated relationality and attributionaiity of the responses. 



20 



Table 3 

Sample Metaphors and Interpretations Used in Experiment 2 



Samples Interpretations 



Relational metaphors: 
A doud is like a 

sponge Rl) Both can hold water Al) Both are soft 

R2) Both can give off water A2) Both are flufify 

A tire is like a 

shoe Rl) Both are uc: move something Al) Both are made of rubber 

R2) Both cover the bottom of sometUng A2) Bot'a can be black 

Double metaphors: 
Plant stems are like 

drinking straws. . Rl) Both can be used to get water Al) Both are long 

R2) Both draw liquid up A2) Both are thin 

Grass is like hair . Rl) Both grow quickly Al) Both are long 

R2) Both cover and protect something A2) Both are skimiy 

Attributional metaphors: 
Jelly beans are like 

balioons Rl) Both are fun at parties Al) Both can be different colors 

R2) Both can be tossed in the air A2) Both are round 

A snake is like a 

hose Rl) Both squirt liquid Al) Both are long 

R2) Both can curl up A2) Both are wi^ey 



Note. Rl and R2 denote relational interpretations, and Al and A2 denote attributional interpretations. 
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Figure Captions 

Figure la. Results of Experiment 1: Mean ratings of relationaiity for interpretations of dificrent types 
of metaphor across age. 

Figure lb. Mean ratings of attributionality for interpretations of different types of metaphors across age. 

Figure 2. Results of Dqieriment 2: Proportion of relational interpretations chosen for different types 
of metaphors across age. 

Figure 3* Results of Experiment 2: Mean ratings of relational and attributional interpretations for 
different types of metaphors across age. 
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